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ABSTRACT 

Context. The exploration of the relation between galaxy sizes and other physical parameters (luminosity, mass, star 
formation rate) has provided important clues for understanding galaxy formation, but such exploration has until recently 
been limited to intermediate redshift objects. 

Aims. We use the currently available CANDELS Deep+Wide surveys in the GOODS-South, UDS and ECS fields, 
complemented by data from the HUDF09 program, to address the relation between size and luminosity at z ~ 7. 
Methods. The six different fields used for this study are characterized by a wide combination of depth and areal 
coverage, well suited for reducing the biases the observed size-magnitude plane. From these fields, we select 153 z-band 
dropout galaxies. Detailed simulations have been carried out for each of these six fields, inserting simulated galaxies at 
different magnitudes and half light radius in the two dimensional images for all the HST bands available and recovering 
them as carried out for the real galaxies. These simulations allow us to derive precisely the completeness as a function 
of size and magnitude and to quantify measurements errors/biases, under the assumption that the 2-D profile of z=7 
galaxies is well represented by an exponential disk function. 

Results. We find in a rather robust way that the half light radius distribution function of z ~ 7 galaxies fainter than 
J = 26.6 is peaked at < 0.1 arcsec (or equivalently 0.5 kpc proper), while at brighter magnitudes high-z galaxies are 
typically larger than ~0.15 arcsec. We also find a well defined size- luminosity relation, Rh oc L 1/l2 . We compute the 
Luminosity Function in the HUDF and P12HUDF fields, finding large spatial variation on the number density of faint 
galaxies. Adopting the size distribution and the size-luminosity relation found for faint galaxies at z=7, we derive a 
mean slope of —1.7 ± 0.1 for the luminosity function of LBGs at this redshift. 

Conclusions. Using this LF, we find that the number of ionizing photons emitted from galaxies at z ~ 7 cannot keep 
the Universe re-ionized if the IGM is clumpy (Chii > 3) and the Lyman continuum escape fraction of high-z LBGs is 
relatively low (/ esc < 0.3). If these results are confirmed and strengthened by future CANDELS data, in particular by 
the forthcoming deep observations in GOODS-South and North and the wide field COSMOS, we can put severe limits 
to the role of galaxies in the reionization of the Universe. 

Key words. Galaxies: distances and redshift - Galaxies: evolution - Galaxies: high redshift - Galaxies: structure 



1. Introduction 

The advent of the WFC3 instrument onboard HST 
has opened a new window for the study of galaxy 
shape, size, and morphology up to very high redshifts 
(jWin dhorst ct al. 2011). The combination of large area, 
fine resolution, and NIR wavelengths achieved with this 
powerful instrument allows us to study galaxies in the UV 
rest frame at z ~ 7 — 8 with impressive accuracy. 
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An important parameter that is useful for constraining 
different models of galaxy formation and evolution is galaxy 
size, measured through its half light radius (hereafter Rh) . 
This quantity gives us an indication of the dynamical state 
of the galaxy itself and the effects of feedback, minor/major 
merging, inflows and outflows. In particular, the relation 
between size and luminosity, or other physical properties 
(stellar mass, dust extinction, etc.), can give insight into 
the detailed galactic assembly processes. 

With the advent of large surveys, like the Sloan Digital 
Sky Survey (SDSS, Abazajian et al. 2004 ) it has been pos- 
sible to study the physical properties of local galaxies with 
great accuracy. Present-day galaxies show a clear correla- 
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tion between size and stellar mass, with the most massive 
galaxies having the largest half-light radii (jShen et al. 2003, 
Graham & Worley (2008) I. This mass-size correlation has 
been found both for elliptical and spiral galaxies. In par- 
ticular, B arden et al. 20 05 pointed out that the same rela- 
tion holds up to z ~ 1 and its normalization is unchanged 
with respect to local dis k galaxies at least for stella r masses 
M > 10 10 M Q . Recently, [Mosleh, M et al. (2012)| extended 
the evolution of the stellar mass-size relation for star- 
forming galaxies till z ~ 7, finding that the typical size of 
LBGs increases toward lower redshifts, in agreement with 
previous measurements at low-z. 

Star Forming galaxies at z ~ 2 — 3, instead, have 
been extensively studied using ground based spectroscopy, 
HST imaging and IFU observations. Lyman Break Galaxies 
(LBGs, ISteidel fc Hamilton 19931 IMadau et al. 1996ft at 
z ~ 2 — 3 show a stellar mass-radius relation already 
established QNagy et al. 201 1[ ILaw et al. 2012[) . Similarly, 
Lyman-a emitters (LAEs) at the same redshifts present 
a correlation between their sizes and other physical prop- 
erties, such as stellar mass, star formation rate (SFR), 
Spectral Energy Distribution (SED) or dust extinction 
(jBond et al. 2012)) . with larger galaxies having higher stel- 
lar masses, higher dust extinction, and higher SFR. The 
half light radius of these LAEs at z = 2, however, is not 
correlated to the EW in Lyman-a. The stellar mass-radius 
relation evolves in redshift as (l + z) _1 , in a manner consis- 



and [ Ferguson et al. (2004) 
Hathi et al. (2008)|at 



tent with the size evolution found by Bouwens et al. (2004) 

for LBGs at z > 2 — 5 and by 
> - 6. 

found a well-defined 
"and observed magni- 



At z 



6, |Bouwens et al. (2006) 
measured size 



correlation between 
tudes for 332 photometrically selected LBG candidates: 
this indicates that a size-luminosity relation could be still 
in place at high-z. At magnitudes fainter than zab ~ 28 
there is a clear lack of galaxies larger than 0.2 arcsec, but 
at such faint levels, the effect of surface brightness dim- 
ming is limitin g the completeness of large g alaxies. At the 
same redshift, |Dow-Hygelund et al. (2007)| found that all 
the Lyman-a emitters are more compact than average rel- 
ative to the observ ed size-magnitude relat ion of the large 



i-dropout sample of Bouwens et al. (2006) 



The evolution of galaxy size with redshift and luminos- 
ity also has important implications for the faint end of the 
Luminosity Function (LF) and the role of low-luminosity 
galaxies in the reionization of the Universe. One of the 
main motivati ons of this work is to answer to the ques- 
tions raised in Grazian et al. (2011) In our previous work 
we have explored different distributions for the half light 
radius of z = 7 galaxies. One of the clearest results is that 
the LF is quite steep (a ~ —2) if faint galaxies are extended 
(Rh ~ 0.2 — Q.iarcsec at J — 28 — 29) while it turns out 
to be similar to lower-z LFs (a ~ —1.7) if objects become 
smaller at relatively faint magnitudes. This is simply due 
to the corrections for incompleteness at the faint end of the 
LF, which are more severe for large and extended galax- 
ies. A steep LF has deep implications for the number of 
ionizing photons produced by galaxies at z ~ 7: a typical 
LF with a ~ —1.7 provides enough light to maintain the 
reionization process only assuming a large escape fraction 
of Lyman Continuum photons (f esc > 20%), an IGM that 
is not clumpy {Chi i < 4 — 6), and extrapolating this steep- 
ness down to very faint flux levels (Afisoo = —10). These 
constraints are valid under the assumptions of a Salpeter 



IMF and ignoring the effects of PopIII stars or other exotic 
sources of ionizing radiation. On the other hand, if faint 
galaxies are extended, then the resulting LF is quite steep 
(a ~ —2) and galaxies alone are able to keep the Universe 
reionized even for less extreme combinations of escape frac- 
tion and dumpiness (/ esc > 5% and Crii < 30). Thus a 
detailed analysis on the typical sizes of high-z galaxies and 
the relation between galaxy size and luminosity is necessary 
to understand wh ether galaxies alone ar e the responsibles 
for reionization. In Grazian et al. (2011) we did not provide 
a definitive answer to these questions, which therefore moti- 
vated the investigation here with a larger sample of galax- 
ies at z ~ 7. Of course, other hypotheses on the sources 
responsible for reionization at such high-z are possible, like 
a top-heavy IMF, a large contribution from PopIII stars, 
or other sources of ionizing photons i.e. high-z AGNs. 

Throughout this paper, we will assume a "concordance" 
cosmology with Hq = 70km s^ 1 Mpc~ x , CIm = 0.3 and 
f^A = 0.7. In this cosmological model, an angular dimension 
of 1 arcsec corresponds to a physical dimension of 5.227 kpc 
(proper) at z=7. 



2. Data 

2.1. The photometric sample 

We have analysed six different data sets observed with the 
NIR camera of HST, the Wide Field Camera 3 (WFC33): 
the UKIDSS Ultra Deep Survey (UDS, 227 sq. arcmin 
to J=26.7) and the Extended Groth Strip (EGS, 110 
sq. arcmin to J=26.7) from the CANDELS-W ide survey 
flGrogin et al. (2011)1 |Koekemoer et al (2011)| ), the Early 
Release Science (| Windhorst et al. 2011jl online GOODS- 
S field (GOODS-ERS, 40 sq. arcmin. to J=27.4), part of 
the GOODS-South Deep (GDS, 27 s q. arcmin. to J=27.8) 
from the CANDELS-De ep survey ( [Grogin et al. (2011)) 
Koekemoer et al. (2011) I, the first year observations of the 
Hubble Ultra Deep Field (HUDF, 4.7 sq. arcmin. to J=29.2) 
and its para llel field (P12HUDF, 4 7 sq. arcmin. to J=29.2) 
described in Bouwens et al. (2011) All together, they allow 
us to cover a broad enough range of galaxy size and lumi- 
nosity to investigate their interrelationship. 



2.1.1. UDS 

The UDS program is part of the CANDELS/Wide 
survey and it was the first wide field to be ob- 
served by the CANDELS observations in the NIR with 
WFC3. It i s centered on the UKIDSS Ultra Deep 
Survey field (jLawrence et al. 2007P and benefits from the 
ground based imaging with SUBARU in BVRiz bands 
([Furusawa et al. 2008[) . deep JHK imaging from UKIRT- 
WFCAM, deep CFHT U band, MIR observations by 
Spitzer (SpUDS and SEDS programs), and intense follow- 
up optical sp ectroscopy from GeminQ VLT-VIMOS , and 
VLT-FORS2 (ISmail et al. 20081 |Simpson et al. 201 2| ). This 
field has been covered by HST in optical and NIR using a 
mosaic grid of tiles and repeated over two epochs. During 
each epoch, each tile has been observed for one orbit (^2000 
seconds), divided into two exposures in J125 (at a depth of 
1/3 orbit) and two exposures in H160 (at a depth of 2/3 



http:/ /www. stsci.edu/hst/wfc3 
2 http:/ /mur. ps.uci.edu/~cooper/IMACS/home. html 
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orbit), together with parallel exposures using ACS/WFC 
in V606 and 1814. Observations and data reduction for the 



UPS field are described in detail in Koekemoer et al. (2011) 



and Grogin et al. (2011) 



The total area covered by the CANDELS/UDS is - 227 
sq. arcmin. down to J=26.7, and H=26.5 magnitudes at 5<t 
in a circular aperture of ~ 0.11 arcsec 2 (corresponding to 
2 times the FWHM of the NIR images). 



2.1.2. EGS 



The EGS (iDavis et al. 2 007) field is part of the 
CANDELS/Wide survey. It covers a region of the sky 
that has been extensively studied by the All-wavelength 
Extended Groth strip International Survey (AEGIS). This 
field has been observed with ACS in the V606 and 1814 
bands and by a number of ground based facilities from the 
U to the K band. These data have been complemented by 
observations with the Spitzer Space Telescope in the 3.6- 
70/im range, in X-ray by Chandra, in FUV by GALEX and 
in radio by VLA. The CANDELS/Wide observations used 
here are only the first half of the area that will be covered 
at the completion of the survey, namely a rectangular grid 
of 6.7 x 30.6 sq. arcmin. The WFC3 observing strategy in 
the EGS mirrors that adopted for the UDS, covering a to- 
tal area of ~ 110 sq. arcmin. down to J=26.7, and H=26.5 
magnitudes at 5er in a circular aperture of ~ 0.11 arcsec 2 
(corresponding to 2 times the FWHM of the NIR images). 
An overview of observations available for the EGS field is 



described in detail in Grogin et al. (2011) 



2.1.3. ERS 

The GOODS-ERS WFC3/IR observations comprised 60 
HST orbits consisting of 10 contiguous pointings in the 
GOODS-South field (HST Program ID 11359), using 3 fil- 
ters per visit (Yogs, J125, -Hi6o), and 2 orbits per filter (for a 
total of 4800-5400s per pointing and filter). The total area 
covered by the GOODS-ERS is ~ 40 sq. arcmin. down to 
Y=27.3, J=27.4, and H=27.4 magnitudes at 5er in an area 
of ~ 0.11 arcsec 2 (corresponding to 2 times the FWHM of 
the images). 

A full description of the imaging dataset for the ERS 
field and of the redu ction procedures adopted is given in 
Grazian et al. (2011) 



2.1.4. GDS 

The CANDELS-Deep survey will cover 125 sq. ar- 
cmin. to ~ lOorbit depth on the central region of 
the GOODS-South and G oods-North fields at completion 
( Koekemoer et al. (2011)j ). At this stage, only 6 WFC3 
pointings (~ 27 sq. arcmin.) in the GOODS-South cen- 
tral region are available, with a 3 orbit depth in the Y105 
band and ~ 2.5 orbits both in J125 and in i?i6o bands. In 
the following, we will refer this field to Goods Deep Survey 
(GDS). We added these images to our database since they 
reach a slightly deeper magnitude limit (J = 27.8 at 5ct in 
a circular aperture of ~ 0.11 arcsec 2 ) than the ERS in a 
comparable area, starting to investigating the region of the 
parameter space dealing with faint and extended objects: 
since they are faint, it is not possible to detect them on wide 



surveys, i.e. the UDS, but they are very rare in ultra-deep 
pencil beam surveys (i.e. HUDF). 

When completed, the GDS survey will open new fron- 
tiers on the size-luminosity studies of high redshift galaxies, 
thanks to its unique combination of both area (125 sq. ar- 
cmin.) and depth (J = 28) in the NIR wavelength range. 

2.1.5. HUDF 

The first year HUDF09 WFC3/IR dataset (HST Program 
ID 11563) is a total of 60 HST or bits, observed in 
September 2009, in a single pointing (|Oesch et al. 20101 
IBouwens et al. 2010[) in three broad-band filters (16 or- 
bits in Y105, 16 in J125, and 28 in i?i6o)- It is one of the 
deepest NIR images ever taken, reaching Y=29.3, J=29.2, 
and H=29.2 total magnitudes for point like sources at 5 
sigma (this S/N ratio is computed in an aperture of ~ 0.11 
arcsec 2 , corresponding to two times the FWHM of the NIR 
images). The area covered by the WFC3-HUDF imaging is 
4.7 sq. arcmin., and the IR data have been drizzled to the 
ACS-HUDF data (jBeckwith et al. 2 006). with a resulting 
pixel scale of 0.03 arcsec and a FWHM of 0.18 arcsec. 

As for the ERS, a full description of this dataset 
and of the reduction procedures adopted are described in 
Grazian et al. (2011) 



2.1.6. P12HUDF 

The HUDF09 program consists of deep observations in 
three different fields (HUDF, P12, P34) on the GOODS- 
South region. They have been already discussed in 



Bouwens et al. (2011) Here we use only data from the P12 
(hereafter P12HUDF) region to double the number statis- 
tics at the faint end of the size-luminosity relation at z ~ 7. 
The P12HUDF field has been observed in the V606, -^775 > and 
Z S50 bands by ACS (jOesch et al. 2007]) and in the Y W5 



J125, and flieo bands by WFC3 {Bouwens et al. (2011) I 
down to a magnitude limit of ~ 29.1 in the VIZ filters and 
~ 29.2 for the YJH ones. These data have been reduced 
using the same approach described above for the GDS and 
HUDF fields. 



2.2. Photometry and Size of faint galaxies 

The photometric catalogs of the UDS, EGS, ERS, GDS, 
P12HUDF, and HUDF fields have been derived in a 
consistent way. Galaxies have been detected in the J125 
band (corresponding to rest frame 1500 A at z=7), and 
their total magnitudes have been computed using the 
MAG_BEST of SExtractor (jBertin fc Arnouts 1996|) . 
using DETECT_MINAREA=9 pixels and 
DETECT_THRESH=ANALYSIS_THRESH=0.7. Colors 
in BVIZY and H bands have been measured running 
SExtractor in dual image mode, using isophotal magni- 
tudes (MAGJSO) for all the galaxies. Since the FWHM of 
the ACS images (0.12") is slightly smaller than the WFC3 
ones, we smoothed the ACS bands with an appropriate 
kernel to reproduce the resolution of the NIR WFC3 
images. This ensures both precise colors for extended 
objects and unbiased photometry for faint sources. Further 
details on the photom etric measurement can be found in 
Grazian et al. (2011) 
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We used the same SExtractor catalog to measure the 
angular sizes of galaxies in the samples. For this purpose 
we have used the half light radius measured by SExtractor 
in the J band. This quantity is the zero-order estima- 
tor of the overall shape of a galaxy that one can adopt, 
and is clearly inadequate to fully describe the whole com- 
plexity of galaxies. However, most of the z ~ 7 galax- 
ies are very faint, close to the detection limit of our im- 
ages: in this case, the more sophisticated estimators that 
can be adop ted to measure gal axy morphology (e.g. the 
CAS system. [Abraha m et al. 19941 or the Gini coefficients, 
ILotz et al. 2004|) cannot be adopted, for lack of adequate 
S/N. 

For this reason, we adopt here the half light radius and, 
hereafter, we will refer to it when generically speaking of 
galaxy sizes. We are aware that, even adopting this simple 
estimator, this approach is still prone to systematic effects. 
These will be addressed with detailed simulations, that are 
described in Sect. 4. 



3. Selecting galaxies at z=7: color criteria 

The selection of galaxies at z ~ 7 uses the well known 
"drop-out" or "Lyman-break" technique. At 6.5 < z < 7.5, 
this feature is sampled by the large Z — Y color, 
as shown by a number of w orks using ground based 
imaging (jOuchi et al. 20091 



Castellano et al. (2010b)]) 
QBouwens fe lllingworth 2006 



Castellano et al. (2010a)] 
and from space 
IMannucci et al. 20071 



IQesch et al. 20091 IQesch et al. 20101 iB unkcr c t al. 20101 

The 

these 



IMcLure et al. 20101 
spectroscopic confirmations 
(shown in IFontana et al. 2 010, 



Bouwens et al. (2011)[ ). 
oT 



candidates 

m Montana et al. zuim IVanzella et al. 20111 
IPentericci et al. 20111 IStark et al. 20111 IQno et al. 20T2]) 
ensures that this technique is robust and the fraction of 
expected interlopers is quite low (less than 20% at z ~ 6, 
as shown in IPentericci et al. 2011|). 



As 



described in 



Koekemoer et al. (2011) for 



Grogin et al. (2011)| 
the 



and 

CAJNDELS data on 
the GOODS deep survey (GDS), the Y105, ^125 > and -H160 
bands are available, as well as in the HUDF and P12HUDF 
fields, while in the wide areas (UDS and EGS) only the 
J125 and Hiqo bands from WFC3 are available. On the 
other hand, in the ERS field the Y098 filter is available 
instead of the Y105 one. This turns out in a slightly different 
selection criteria for these fields. 

For the HUDF, P12HUDF, and GDS fields 
we thus adopted the color criteria discussed in 



Grazian et al. (2011) 



z-Y 105 > 0.8, 

z-Y 105 > 0.9 + 0.75(^05- J125), 

Z-Y 105 > -1.1 + 4.0(^105-^125) 

while for the ERS we adopted the following color criteria: 



Z-Y 09S > 1.1, 
z-Y 09S > 0.55 H 
z - Y 09S > -0.5 



1.25(Y"o98 — ^125), 
- 2.0(V 98 - Jisb) 



to take into account the differences in the transmission of 
Y098 and Y 1Q5 filters. 

For the non-detection in b ands bluer than Z, we 
adopt the same criteria used in Castellano et al. (2010a)| 



Castellano et al. (2010b) and in Grazian et al. (2011) 
{S/N < 2 in all BVI bands and S/N < 1 in at least two of 
them). 

Following the above criteria, we select 20 candidates 
in the HUDF field down to J < 29.2, while in the ERS 
we find 22 z-dropout candidates at J < 27.4, respectively. 
Their characteristics are described in Table 1 and 2 of 



Grazian et al. (2011)| For the GDS and P12HUDF we re- 
cover 21 and 23 galaxies down to J < 27.8 and J < 29.2, 
respectively. 

For the UDS and EGS fields, where the only photometry 
available from space is in Veoe, ^814, ^125, and -H160 bands, 
we adopt the 7si4-dropout color selection, which gives a 
more extended redshift window for selecting galaxy can- 
didates (6.4 < z < 8.5). In particular, the color criteria 
adopted are: 



^814 
^814 



J125 > 2.0, 

Jl25 > 1-4 



2.5(Ji25 — #160 ) 



with J125 < 26.7 and non detected in the V<306 band 
{S/N{Veo6) < 1-5), which is satisfied by 46 galaxy can- 
didates in the UDS and 21 in the EGS (the present obser- 
vation of this field covers only half of the expected FoV). 

In appendix A we provide the tables with position, J125 
band magnitude and size of the z ~ 7 candidates found in 
the GDS, P12HUDF, EGS, and UDS fields. 

Fig. Q] shows the selection functions in redshift for the 
different color criteria described above. They have been de- 
rived using the simulations described in the next section, 
and applying to the simulated catalogs the same color se- 
lections described above for real dropouts. The selection 
function for the CANDELS wide surveys (UDS and EGS) 
is more extended due to the limited set of HST filters used 
(VIJH) with respect to those used in GOODS Deep or 
HUDF surveys (BVIZYJH). In particular, in the UDS and 
EGS surveys we adopt the isi4 drop-out criterion, which 
selects lower-z candidates than the classical Zs5o drop-out 
galaxies. Indeed, the lack of the HST Y band for UDS and 
EGS results in a redshift selection function which is more 
extended towards higher-z objects. The redshift distribu- 
tions expected for HUDF and for ERS are similar, despite 
the different Y band filters adopted. The mean redshifts of 
all the distributions are around 6.7, indicating that there's 
no difference in the typical redshifts of selected dropout 
galaxies in these six surveys. 

3.1. The reliability of candidates selected on the CANDELS 
Wide surveys 

In the CANDELS wide surveys analysed here (UDS and 
EGS), the reliability of z ~ 7 galaxies candidates can be 
hampered by the limited set of HST images (VIJH) avail- 
able. We have carefully investigated the properties of our 
dropout candidates in the UDS field using both the pho- 
tometric redshift technique based on SED fitting and the 
stacking of HST images of all the candidates to check the 
expected non detection in the optical images. 

To compute the photometric redshifts for our candi- 
dates in the UDS field, we complement the VIJH HST 
photometry with the ground based images available from 
CFHT in the U band, from Subaru in the BVRiz filters 
and from UKIRT in the JHK bands. We added also the 
IRAC photometry both from SpUDS and SEDS programs, 
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-i — i — i — i — | — i — i — 
HUDF P12HUDF GDS 
ERS 

UDS EGS 



f 1 1 



Fig. 1. The selection function in redshift for the different 
color criteria adopted in the six CANDELS and HUDF 
fields. The peaks of the distributions have been normal- 
ized to f to ease the comparison. The selection function 
for the CANDELS wide surveys (UDS and EGS) is more 
extended due to the limited set of filters used (VIJH) with 
respect to the filters used in GOODS Deep or HUDF sur- 
veys (BVIZYJH). 



using the TFIT software to match the resolution of HST 
to the PSF of ground based images or of space based im- 
ages by Spitzer. The adopted technique for the deriva- 
tion of the photometric c atalog is described in detail in 



Galametz et al. (in prep. 

The photometric redshift analysis used to selected 
the high-redshift candidates in the UDS field was per- 
formed using the t emplate-fitting code developed by 
McLure et al. (2011) For the purposes of this study we 
employed the IBruzual fc Chariot 20031 stellar evolution 
models, with metallicities ranging from solar (Zq) to 
l/50th solar (O.O2Z ). Models with instantaneous bursts of 
star-formation, constant star-formation and star-formation 
rates exponentially declining with characteristic timescales 
in the range 50 Myrs < t < 10 Gyrs were all consid- 
ered. The ages of the stellar population models were allowed 
to range from 10 Myrs to 13.7 Gyrs, but were required 
to be less than the age of the Univ erse at each redshift. 



Dust reddening was described by the Calzetti et al. (2000) 
attenuation law, and allowed to vary within the range 
0.0 < Ay < 2.5 magnitudes. Inter-galactic me dium absorp- 
tion short- ward of Lya was described by the IMadau 19951 
prescription, and a Chabrier IMF was assumed in all cases. 
For each model of this grid, we have computed the expected 
magnitudes in our filter set, and found the best-fitting tem- 
plate with a standard \ 2 normalisation. 

Within the UDS sample, only 27 galaxies have a robust 
photometric redshift z > 6.8, while the remaining galax- 
ies are either consistent with having a slightly lower value 
(6.3 < z < 6.8), or have two comparable solutions for the 
photometric redshifts (one at z ~2 and the other at z ~ 7). 
These 27 candidates are part of another work on the lu- 
minosity function of bright z-dropout LBGs and will be 



described in detail in McLure et al. (in prep.) with a full 
description of the photometric redshifts used here. We de- 
cided to keep the full sample of 46 candidates for UDS in our 
work, after checking that they have comparable properties 
(both in luminosity an d half light radius) to the sub-sample 
of the 27 candidates of McLure et al. (in prep.) We repeat 
the same check on EGS, and we find that 13 galaxies out 
of 21 have zphot > 6.8. 

FigUl shows the weighted mean of the 46 Isi4-dropout 
candidates in the UDS field for Veo6, ^814> ^125, and i?i6o 
bands. The stack image has no detection in the Vqog band, 
and a very faint detection in the /si4 band. The fit to the 
SED of the stack indicates that the photometric redshift is 
consistent with zphot ~ 6.5 — 7.5 and no secondary peak at 
z ~ 1 — 2 is present. Restricting the sample to galaxies with 
magnitude J125 < 26.2 and angular size Rh > 0.2 arcsec, 
we obtain essentially the same stacked SED and photomet- 
ric redshift probability as that for the whole sample. This 
indicates that the 46 candidates in the UDS field are robust 
against contamination by low-z interlopers, and bright and 
extended Isi4-dropouts are not dominated by foreground 
sources mimicking our color criteria. In the other fields, 
where the depth of HST images are similar (EGS) or deeper 
(ERS, GDS, HUDF, P12HUDF), we expect as well a re- 
duced contamination by lower-z sources. 



4. Simulations for estimating completeness and 
systematic effects 

While the selection criteria described above are formally de- 
signed to select a pure sample of high-z candidates, they are 
in practice applied to very faint objects, typically close to 
the limiting depth of the images. At these limits, systemat- 
ics may significantly affect their detection and the accurate 
estimate of their colors or apparent dimensions. To take 
into account all the systematic effects (completeness, photo- 
metric scatter, size scatter) involved in the size-luminosity 
relation, we carried out a set of detailed simulations. 

These simulations have two main goals. The first is to 
estimate the incompleteness that affects the detection of 
faint galaxies as a function of their size. The incompleteness 
arises not only from the difficulty of detecting faint sources, 
but also from the effect of noise in the many bands that we 
need to select high-z candidates. As expected, we become 
severely dominated by incompleteness as we attempt the 
detection of extended sources at the faintest magnitudes, 
and this effect must be taken into account when trying to 
infer the intrinsic size-luminosity relation. 

The second goal is to quantify the systematic bi- 
ases in the measure of the half-light radius provided by 
SExtractor, as a function of magnitude and galaxy size. 
These systematic biases affect the estimate of the distribu- 
tion of galaxy size in a non-negligible way, and therefore 
also need to be treated carefully in the analysis. 

In this section we describe in some detail the simula- 
tions adopted to estimate these systematic effects and the 
technique that we adopt to include them in the derivation 
of the correct size distribution. The reader directly inter- 
ested in the observational results may skip this section and 
proceed to Sect. 5. 
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Fig. 2. Top: the image stacking of the 46 Jsu-dropout can- 
didates in the UDS held for Vqo6, hi4, Ji2b, and Hiqq bands. 
Middle: the VIJH photometry of this image stacking (black 
dots) has been fitted with an extended library of synthetic 
galaxies, and the best fit (blue line) is consistent with a 
photometric redshift z > 6.5, with no secondary peak at 
z ~ f — 2. The model templates are computed with the spec- 
tral synthesis models BC03 (jBruzual &: Chariot 2003|l . and 
chosen to broadly encompass the variety of star-formation 
histories, metallicities and extinction of re al galaxies, as 
described in detail in McLure et al. (2011) At z=6.5 the 
Lyman continuum absorption is at an observed wavelength 
of A = 6840 A, and a faint emission in the isi4 filter can be 
still compatible with a partially neutral IGM at this red- 
shift. Bottom: The continuous curve is the reduced x 2 as a 
function of photometric redshift z, while the dotted line is 
the probability P(z) rescaled to have the peak at the same 
level of x 2 , to improve visibility. 



4.1. Completeness 



We follo w the procedure described in|Grazian et al. ( 2011 
and in Castellano et al. (2010a)| Castcllano et al. (2010b 
to obtain a realistic catalog of simulated high redshift 
sources. We first produce a set of UV absolute magni- 
tudes and redshifts ( Mi5qq,z) according to an evolving 
Luminosity Function by Castellano et al. (2010b) and then 
convert it into a set of predicted magnitudes using the 



BC03 models with a range of ag es, metallicities, dust con- 
tent and Lya emission as in in Grazian et al. (2011) We 
have also added the interg alactic (IGM) abs orption using 



the average evolution as in Fan et al. (2006) The redshift 
range adopted for the simulated galaxies in all the fields 
is 5.5 < z < 8.5 while the absolute magnitudes A/1500 run 
from -17 to -23. The input distribution for the axial ratio 
parameter b/a is assumed flat from to 1. 

For each simulated galaxy, an input half-light radius 
has been assigned by selecting at random from a uni- 
form distribution between 0.0 and 1.0 arcsec. The two- 
dimensional profiles adopted during the simulations are 
typical of disk galaxies, i.e. an exponentials. In Appendix 
B we show the results of our simulations under the as- 
sumption o f a Sersic profile of i ndex n = 4. Previous 
works (e.g. Ferguson et al. (2004) ) dealing with the size- 
luminosity relation at lower-z, assumed a mix of morpho- 
logical models, with a fraction of 70% spiral and 30% el- 
lipticals at z=3-4. From the observational point of view, 



Ravindranath et al. (20 06) find that 40% of the LBGs at 
2.5 < z < 5 have light profiles close to exponential, as seen 
for disk galaxies, and only ~30% have high Sersic index, as 
seen in nearby spheroids. They also find a significant frac- 
tion (~30%) of galaxies with multiple cores or disturbed 
morphologies, suggestive of close pairs or on-going galaxy 
mergers. Using the ultradeep images of the HUDF in BVIZ 
bands by ACS, Hathi et al. (2008b) found that the sum of 
the images of all LBGs selected at 4 < z < 6 are well 
fit by Sersic profiles with an index n < 2, indicating that 
these galaxies follow a disk-lik e profile in their ce ntral re- 
gion, as recently confirmed by Fathi et al. (2012) (but see 



Graham & Guzman (2003) for a different interpretation). 
Moreover, recent results from spectroscopically confirmed 



LBGs at z < 3 QNagy et al. 20111 ILaw et al. 20L2| and 
z ~ 5 (Douglas et al. 20101 show that the light profile of 



these galaxies are not represented by an elliptical morphol- 
ogy, and the brightest galaxies are typically described by an 
exponential disk. Based on these considerations, from now 
on we assume that galaxies at z ~ 7 are typically approx- 
imated by an exponential disk morphology and that there 
is no galaxy with an r 1 / 4 profile at such high-z. Though 
this is only a rough approximation, there are indications, 
at redshifts lower than 7, that it is a realistic assumption. 

The synthetic galaxies are placed at random positions 
in the real 2-dimensional FITS images, avoiding positions 
where where real galaxies or stars are observed. To this aim, 
the segmentation image created by SExtractor during the 
detection of real objects in the observed J band has been 
used for each field. To avoid an excessive and unphysical 
crowding in the simulated images, we have included only 
200 objects of the same flux and morphology each time. We 
then perform the detection in the synthetic images using 
SExtractor with the same parameters adopted for the real 
images. We simulate all the available bands, i.e. from the 
B to the H bands for the ERS and HUDF fields while for 
the UDS and EGS fields we have simulated only the VIJH 
filters from HST observations. We repeated the simulation 
until a total of at least 5 x 10 5 objects were tested for each 
of the six fields described above. 

These simulations first provide the detection complete- 
ness as a function of the input half light radius and input 
total magnitudes. The typical output is shown in FigO 
where we plot the measured total magnitude J out and half- 
light radius Rh out in the case of the UDS field. Small dots 
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Fig. 3. The observed J magnitude vs size of simulated 
(small black dots) and observed galaxies (red triangles) at 
z ~ 7 for the UDS field. The solid blue line shows the 50% 
completeness level for an input simulated profile of disk 
galaxy, while the dashed green line is the same limit for a 
Sersic profile with n=4. 



show the position of the simulated objects, while big red 
triangles are the observed z ~ 7 LBG candidates. Since the 
simulated input catalog contains objects of all sizes up to 1" 
in equal proportions, Fig|3] clearly shows that, at a given 
magnitude, the fraction of detected objects drops above 
some critical size. We use these simulations to compute the 
50% completeness threshold as a function of magnitude, 
which is shown as a blue solid line in FigJ3J 

We repeated the same analysis for all the six fields in 
our survey and we show the output in Fig |B.3l of Appendix 
B, for the EGS, ERS, GDS, P12HUDF and HUDF fields, 
respectively. Results are qualitatively very similar, the only 
major difference being that the 50% completeness curve 
shifts to fainter magnitudes on the deeper fields, as ex- 
pected. 

We also explored the effects of our assumption for the 
galaxy surface-brightness profiles. We repeated the analysis 
adopting as input for our simulations a Sersic profile with 
index n = 4, and show the completeness obtained with 
this profile in Fig|3] (dashed line), comparing it with the 
previous result for an exponential profile (solid line). As 
expected, the 50% completeness threshold occurs at lower 
radii (for a given magnitude) with the elliptical profile than 
with the exponential one. It is worth noting that the ellipti- 
cal profile cannot explain the presence of galaxies brighter 
than J — 26 and larger than 0.3 arcsec in half light ra- 
dius that we are finding in the UDS field. It is useful to 
stress that the completeness due to a Sersic profile with 
index n = 4 carried out here is only a simple exercise and 
in the following sections of this paper we make the rea- 
sonable assumption that all the z ~ 7 candidates found in 
the CANDELS and HUDF fields have an exponential disk 
profile. 

A second output of our simulations is the estimate of the 
biases in the measurement of the half-light radius as a func- 
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Fig. 4. The results of our simulations for z ~ 7 galaxies 
with exponential disk profiles in the UDS and HUDF fields. 
The plot shows the comparison between the half-light ra- 
dius (in arcsec) as measured by SExtractor as a function of 
the input one. The upper and lower panel refer to different 
total magnitudes in two different simulations, as reported 
in the legend. The red line shows the identity relation. The 
blue points and errorbars show the average value and the 
relevant r.m.s. of the output half-light radius. At small sizes 
the output half light radius is typically larger than the in- 
put one due to the convolution with the instrumental PSF 
carried out during the simulations. Note that the limits of 
the top panel are different from the ones in the middle and 
bottom panels. 



tion of luminosity and size. We show the results in FigfJ] 
where we plot the measured Rh as a function of the input 
one, in three magnitude ranges. The upper and middle plots 
refer to the UDS field, while the bottom plot describes the 
same simulation for the HUDF field. The upper panel shows 
the result for relatively bright objects (21 < J < 24). While 
no z ~ 7 galaxies are found in this magnitude range, it is 
instructive to see that systematic effects are important even 
at the bright end. As expected, the measured size of the ob- 
jects cannot be smaller than the instrumental PSF (which 
is about 0.18" of FWHM in the J band, corresponding to 
an half light radius of 0.11"). Because of the convolution 
with the PSF, all objects intrinsically smaller than ~ 0.2" 
are biased high by the SExtractor estimate. Above 0.4", 
the opposite starts to occur. This is due to the detection 
algorithm adopted. In SExtractor an object is detected only 
if a given number of pixels have an intensity above a de- 
fined threshold, and the half light radius is computed using 
only those pixels. For faint and quite extended galaxies, the 
surface brightness far from the center is very low and the 
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galaxy merges into the noise, with the effect of underesti- 
mating both the total flux and the effective size for these 
objects. The middle and bottom panels of Fig0] show the 
same simulations carried out in two magnitude ranges that 
are critical for the aims of this work (note that the half 
light radius range in the x and y-axis is different from the 
one in the top panel). 

The combined effect of the two systematic biases de- 
scribed above (on the detection and on the estimated size) 
has a measurable impact on the observed distribution of 
galaxy sizes. An example is given in FigJSJ We assume that 
the real input distribution is a log-normal with mean half 
light radius of 0.6 arcsec and a ubh of 0.5 (black histogram) 
which has a mode (peak) at 0.2 arcsec. These values are not 
representative of the best fitting parameters for the UDS 
half light radius distribution, but they are randomly chosen 
just to provide an example with an extended tail, to show 
the selection effects at large Rh. 

The distribution of the measured sizes (red histogram) 
of the simulated objects deviates from the input distribu- 
tion in two respects. At small sizes, the distribution is trun- 
cated below 0.1-0.15" and the peak is moved to a slightly 
larger value, because of the convolution with the PSF. At 
large sizes, a clear cut above half light radius of 0.4 arcsec 
is evident, due to the detection incompleteness discussed in 
FigGH We also note that when the color criteria for z ~ 7 are 
applied, the amplitude of the histogram is reduced but its 
shape is not changed (magenta distribution) , implying that 
the color selection is not significantly affecting the half-light 
distribution of the simulated objects. The Rh distribution 
for real objects, scaled in normalization to match the num- 
ber of simulated galaxies, is shown by the blue histogram 
in Fig|5] 



4.2. Finding the best fit to the observed distribution of 
galaxy half-light radius 

We finally use our simulations to recover the true size dis- 
tribution of z=7 galaxies, under the assumption of a par- 
ticular functional form. We parameterized this as a log- 
normal function in half light radius, with the two parame- 
ters Rh moc i e and obm, that are independent from the input 
luminosities for each simulated sample. This assumption is 
based on the fact that the observed size distribution at dif- 
ferent rcdshifts is characterized by a well-defined peak and a 
tail towards more extended objects (e.g. |Nagy et al. 2011| ). 
The log-normal functional form naturally fits this shape, 
and it is characterized by two parameters, Rh moc [ e and crjih, 
the peak and the dispersion of the log-normal half light ra- 
dius distribution. Moreover, from the theoretical point of 
view, the log-normal function is expected as a natural dis- 
tribution for galaxy sizes QMo, Mao fc White 1998] ). 

In order to recover the intrinsic shape of the distribution 
we have adopted a Maximum Likelihood (ML) approach. 
For any choice of the free parameters of the log-normal 
function (Rh mo( i e and (TRh), the resulting intrinsic distri- 
bution is first convolved with the observational biases (as 
described above, see FigJS]) to get the expected number of 
sources as a function of the measured half light radius. The 
total number of observed galaxies has been matched to the 
simulated one. Then, a Poisson likelihood is computed com- 
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Fig. 5. The size distributions of simulated and observed 
galaxies at z ~ 7 for the UDS field. The black histogram 
represents the input log-normal function with mean half 
light radius of 0.6 arcsec and a URh of 0.5, resulting in a 
peak at 0.2 arcsec (mode). This is not the best fit for the 
observed size distribution, but only an example to show the 
effects of incompleteness at large half light radii. The input 
galaxies are simulated down to a magnitude of J = 26.7, 
which is the nominal limit for the UDS field. The red his- 
togram is the size distribution of all galaxies detected in the 
J band UDS image, irrespectively of magnitudes and col- 
ors, while the magenta histogram shows only the galaxies 
at z ~ 7 selected by our color criteria. Last, the blue dis- 
tribution represents the observed galaxy sizes in the UDS, 
scaled in normalization to match the number of simulated 
galaxies. 

paring the simulated and observed Rh distributions, using 
the usual formula: 



£=n< 



-N. 



(AT .\N ob! 

(N obSti )l 



(1) 



where N b s ^ is the observed number of sources in the half 
light radius interval i, N exp ^ is the expected number of 
simulated sources in the same half light radius interval, and 
Ilj is the product symbol. 

Because the observed J-band magnitude range of our 
z ~ 7 candidates in the six CANDELS fields is roughly two 
magnitudes, and even larger for HUDF and P12HUDF, we 
limit both the observed and the simulated samples to a 
relatively small magnitude interval for the maximum like- 
lihood computation. For each field, we use only galaxies 
(both observed and simulated) in the magnitude range 
25.6 < J < 26.6 to compute the likelihood for bright galax- 
ies; for the intermediate magnitude bin we limit the inter- 
val to 26.6 < J < 27.6, while for the faintest bin we adopt 
27.6 < J < 28.6 as limits. Even though a size-luminosity 
relation holds at z ~ 7, as we will show later in this paper, 
this choice ensures that the change in Rh moc i e inside each 
analysed magnitude interval is small. 

For each combination of the two parameters Rh mo d e and 
<jRh , the peak and the dispersion of the log- normal half light 
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Fig. 6. The best fit for the size distribution of the observed 
galaxies at z ~ 7 in the magnitude range 25.6 < J < 26.6 
for the UDS field. Top: the cumulative distribution for the 
observed galaxy sizes (black histogram) is compared with 
the same for the simulated galaxies (blue curve), and a K- 
S p-value is computed. Bottom: The observed (black) and 
simulated histograms (blue), as described in the top panel, 
are compared with a Maximum Likelihood approach. 
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Fig. 7. The confidence levels obtained with the maximum 
likelihood approach for the size distribution on the mag- 
nitude interval 25.6 < J < 26.6. The best fit is indicated 
by the magenta point while the green, blue, and red re- 
gions define the uncertainties at 68%, 95%, and 99.7% (1,2, 
and 3 sigma) confidence level, respectively. The o~Rh is ba- 
sically unconstrained, while the parameter Rh mo d e is well 
constrained between 0.14 and 0.22 arcsec at 1 sigma. 



radius distribution, we compute the maximum likelihood as 
described in Eq[TJ A typical example is provided in Fig[51 
that shows the best fit distribution resulting from compar- 
ing observations and simulations in the UDS field, using 
z ~ 7 galaxies in the magnitude range 25.6 < J < 26.6. We 
plot both the differential (bottom) as well as the cumula- 
tive size distributions (top) . This figure shows the observed 
distributions of half light radii (black segmented line) and 
a comparison is made with the cumulative distributions re- 
sulting from the simulations discussed above (blue curve) 
after all the systematic effects are taken into account, es- 
pecially the convolution of the intrinsic galaxy shape with 
the observed PSF. We compute the Kolmogorov-Smirnov 
p-value by comparing the observed and the simulated cu- 
mulatives. Since the K-S test is not sensitive to the details 
of the distribution, we prefer the Maximum Likelihood ap- 
proach (bottom panel), as we discussed above, to find the 
best agreement between simulations and observations. 

For each of the six fields we scan a grid in two param- 
eters, Rhmode and ctr^. The grid extends from 0.01 to 0.4 
arcsec in Rh moc i e (corresponding to an interval of 0.02-1.0 
arcsec in Rh rnean ) and from 0.02 to 1.0 arcsec in <jrhi for a 
bin size of 0.01 arcsec in Rh mo de and 0.02 in o~Rh- The choice 
of parameters (Rh mo d e , o~Rh) that maximizes the likelihood 
is considered as the best fit for the real input distribution. 
To improve the signal-to-noise ratio of the fitting procedure, 
we combine the likelihoods of the fields within the same J- 
band magnitude range. In addition to the best fit values, 
the ML approach allows us to define the allowed confidence 
intervals on the free parameters Rh mo de and ctr/j. 

Fig E] shows the likelihood distribution for the combined 
UDS, EGS, ERS and GDS fields for galaxies in the magni- 
tude range 25.6 < J < 26.6. The best fit is indicated by the 



magenta point while the green, blue, and red regions define 
the uncertainties at 68%, 95%, and 99.7% (1,2, and 3 sigma) 
confidence level, respectively. In Fig^\Rh mo d e and o~Rh rep- 
resent the intrinsic parameters of the input log-normal dis- 
tribution before convolving it with all the observational ef- 
fects (PSF convolution, noise, detection and size measure- 
ments). Fig lC.ll in Appendix C shows the same plot for the 
other J band intervals, namely the combination of all the 
fields for the intermediate magnitude bin 26.6 < J < 27.6, 
and the combined GDS, P12HUDF and HUDF fields for the 
faintest bin 27.6 < J < 28.6, respectively. From this statis- 
tical analysis we derive the best fit values Rh mo d e and URh 
used in the following, together with their 68% confidence 
level intervals. 

In FigJ7]the peak of the half light distribution (Rh mo d e ) 
for the bright sample (25.6 < J < 26.6) is well constrained 
by the present observations, while it is not possible to put 
reasonable limits for the spread of the size distribution 
(o~Rh)- The reason for this behavior is clear going back to 
FigJS] input distributions with extended tails like the black 
histogram, after the convolution with simulated effects, are 
affected by a strong incompleteness at large sizes (red and 
magenta histograms) and thus the present data in the UDS 
and EGS fields, even after detailed comparison with simula- 
tions, cannot provide stringent constraints to the extensions 
of the intrinsic distributions in size. 

5. Results 

5.1. Faint galaxies are always small 

First of all, we present the relation between galaxy half light 
radii and fluxes for our z ~ 7 sample. FigJS] summarizes the 
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Fig. 8. The observed J magnitude vs size of candidate 
galaxies at z ~ 7 for all the six fields analysed. The black 
circles indicates galaxies in the UDS field, while crosses are 
associated to objects in the EGS field. The solid lines show 
the 50% completeness levels of the six different surveys for 
an input simulated profile of exponential disk galaxy. The 
galaxy size (Rh out ) is the observed half light radius in arc- 
sec measured by SExtractor and it is not deconvolved by 
the PSF. 



result for the observed size-magnitude distribution at z ~ 7 
for all the six fields investigated in this work. The solid 
lines show the 50% completeness levels of the six different 
surveys for an input simulated profile of exponential disk 
galaxy, as described above. The galaxy size (Rh out ) is the 
observed half light radius in arcsec measured by SExtractor, 
without any attempt to deconvolve it for the effect of the 
PSF. 

It is interesting to note in this plot the lack of galaxies in 
the region of large sizes but faint magnitudes (or low surface 
brightness) objects, namely fainter than J=26.6 and larger 
than 0.2 arcsec (corresponding to a physical dimension of 
1.15 kpc at z=7). Two notable exceptions are 2 galaxy can- 
didates, one in the GDS and the other in the P12HUDF 
field. Visual inspection of these two objects indicates that 
they are both close to bright and extended galaxies, thus 
their magnitudes or sizes could be affected by the pres- 
ence of the bright interlopers. They do not show any evi- 
dence of merging or clumpy morphology. We included these 
two galaxies in the Maximum Likelihood analysis described 
above. Excluding these two objects from FigUl we detect a 
clear lack of faint and extended galaxies at z ~ 7, corrobo- 
rated by very large statistics (153 candidates over 6 fields) 
and down to very faint magnitude limits (J ~ 29). Limiting 
our analysis to 26.6 < J < 27.6, where the lack of extended 
galaxies is evident and where we are reasonably complete 
with our survey, we have 41 galaxies. Moreover, as shown 
in FigfJ] (bottom panel), in this magnitude range the true 
value of the galaxy size is not particular underestimated, 



and from our simulations we expect to find exponential disk 
galaxies with Rh — 0.3 — 0.4 arcsec, if they exist. None of 
them have been found on HUDF and P12HUDF. 

From the best fit results of Fig[7] and Fig lC.ll in 
Appendix C it is clear that the parameter urh, which reg- 
ulates the amplitude of the log-normal distribution in size, 
is basically unconstrained in almost all the fields (see also 
Table [l}. It is also worth noting that in Fig lC.ll we have a 
marginal indication (68% confidence level) that the param- 
eter ubm is less than 0.14 arcsec on the magnitude interval 
26.6 < J < 27.6. If confirmed, this could strengthen the evi- 
dence of a lack of extended galaxies at z=7 in the faint lumi- 
nosity regime ( J ~ 27 — 28). Conversely, as shown in FigJH 
at magnitude brighter than J = 26.6 (i.e. A/1500 ~ —20.5 
for z=7) galaxies can be as extended as Rh = 0.4 arcsec, or 
2.3 kpc physical, while fainter than this limit the sizes are 
always less than 0.2 arcsec. 

5.2. The best fit log-normal distribution 

The visual impression derived from FigjS] is confirmed by 
the robust analysis with the ML method. Adopting the for- 
malism described above, we fit the observed size distribu- 
tions in the six fields with a log-normal function, taking 
into account all the observational biases. Results are sum- 
marised in FigEl where we plot the peak (mode) half light 
radius (of the intrinsic distribution) at different luminosi- 
ties of the z ~ 7 galaxy candidates in each survey. The 
brightest point refers to galaxies in the magnitude range 
25.6 < J < 26.6, while the middle point at Mjjv ~ —19.5 
is the result of the joint Maximum Likelihood using ob- 
jects with 26.6 < J < 27.6. In the faint bin we combine 
the Likelihood values for the GDS, HUDF, and P12HUDF 
fields at 27.6 < J < 28.6. The vertical error bars are the 
ler uncertainties on Rh mo d e found with the simulations de- 
scribed above, while the horizontal error bars show the 
minimum-maximum range in luminosity of the sample. The 
results for each magnitude interval are summarized in Table 

ED 

Overplotting simple power-law relations in Fig|9l it is 
clear that there is a strong dependence of the size on lu- 
minosity, Rh oc L 1 / 2 or Rh oc L 1 / 3 . We cannot exclude 
however that the size distribution is a constant function 
(Rh ~ 0.16 arcsec) at L > L* and then there is a cut- 
off to 0.08 arcsec (=0.4 kpc at z=7) for fainter galaxies, 
rather than being represented by a smoother trend with 
luminosity. The point at Mjjv — —18.5 deserves also par- 
ticular attention: it is the combination of two fields in 
particular, the HUDF and P12HUDF, since the five GDS 
galaxies at J ~ 27.6 are not able to provide strong con- 
straints to the size distribution. If we check the results of 
the two ultradeep fields separately, we find two different 
best fit values, Rh mo d e = 0.09 ± 0.02 arcsec for HUDF and 
Rhmode = 0.03 ± 0.03 for P12HUDF. The differences be- 
tween these two samples are also evident in Fig[8j where 
the typical galaxies in the HUDF have a larger observed 
Rh with respect to the ones in the parallel field. The two 
inconsistent results can be due to the small volumes cov- 
ered by these two HST pointed observations, which can be 
affected by a strong field to field variation. Larger areas 
covered by HST at a similar depth of the HUDF are thus 
required in order to solve the inconsistency of Rh estimate 
at faint magnitude limits. 
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Table 1. Size-Luminosity Results at z ~ 7 



Interval 


N cand 


M uv 




1V1 UV 


Rhmode 


- rti L mode 


Rh up 

^"■mode 




low 
°Rh 


up 












(arcsec) 


-la 


+ 1(7 


(arcsec) 


-la 


+ la 


25.6 < J < 26.6 


63 


-20.5 


-21.0 


-20.0 


0.16 


0.14 


0.22 


0.36 


0.34 


0.98 


26.6 < J < 27.6 


36 


-19.5 


-20.0 


-19.0 


0.09 


0.07 


0.11 


0.06 


0.04 


0.14 


27.6 < J < 28.6 


32 


-18.5 


-19.0 


-18.0 


0.08 


0.04 


0.10 


0.04 


0.02 


1.00 


26.6 < J < 28.6 


68 


-19.0 


-20.0 


-18.0 


0.09 


0.08 


0.10 


0.06 


0.04 


0.10 



N C and is the number of galaxy candidates at z ~ 7 used in this work, with observed magnitudes 25.6 < J < 26.6, 26.6 < J < 27.6, 
and 27.6 < J < 28.6 in the six fields analysed here. The columns M^ v n and M™y x refer to the minimum and maximum in 
absolute magnitudes of the observed sample, while Rh m ™ de and Rh^ ode indicate the la lower and upper limits for the Rh mo de 
parameters, in arcsec. The last two columns indicate the la uncertainties for the parameter aRh, in arcsec. 



-0.4 




Fig. 9. The trend of galaxy size (in arcsec and logarithmic 
scale) with UV absolute magnitude (1500Arest frame) at 
z=7 and the comparison with some simple power-law rela- 
tions. The data point at Mjjv ~ —20.5 is related to galaxies 
in the magnitude range 25.6 < J < 26.6, the one at -19.5 
is derived using objects with 26.6 < J < 27.6, while the 
faint bin at -18.5 is the combination of GDS, HUDF, and 
P12HUDF fields (where the maximum likelihood has been 
computed restricting to 27.6 < J < 28.6). 



The faint bin of the size-luminosity relation in Fig(§] 
is particularly problematic also for another reason. From 
FigHit is clear that in the HUDF and P12HUDF, for galax- 
ies with input Rh < 0.1, the size of galaxies as measured 
by SExtractor is completely dominated by the PSF of HST 
and the resulting Rh(output) is around 0.12 arcsec. This 
feature in the HUDF and P12HUDF fields is present both 
for relatively bright (J ~ 26.5) and for faint (J ~ 28.5) 
simulated galaxies. Moreover, at J ~ 28 — 28.5, the galaxy 
size is under-estimated for simulated objects with input Rh 
greater than 0.2 arcsec. Thus, it is reasonable to think that 
the present z ~ 7 galaxies at J ~ 28 cannot give strong 
constraint to Rh mo de estimation, and the behavior of the 
size-luminosity relation for Mjjy > —19 is presently not 
robust. 

To check the reliability of the Rh mo d e determination 
at the faintest magnitudes, we have also investigated in 
the HUDF the power of the maximum likelihood test on 



the discrimination between different best fit solutions. The 
relatively large PSF of WFC3 with respect to the ex- 
pected size of galaxies at z=7 and Muv > — 19 can in- 
duce the reader to think that it is not possible to distin- 
guish between values of Rh mo de smaller that ~ 0.1 arc- 
sec at J ~ 28. We have verified for HUDF galaxies in the 
magnitude range 27.6 < J < 28.6 that an input distribu- 
tion with Rh moc i e = 0.02 arcsec has a likelihood parameter 
that is 3a off from the best fit (Rh mo d e = 0.08 arcsec), 
while if we choose Rh mo d e = 0.14 arcsec the likelihood test 
can reject this solution at 2a level. Thus, we can conclude 
that our simulations are able to distinguish between val- 
ues of Rh mo de smaller than the actual size of the WFC3 
PSF. As we discussed above, the main uncertainty on the 
size determination at Mjjv > —19 is the field-to- field vari- 
ance, with significantly different values between the HUDF 
(Rhmode = 0.09 ± 0.02 arcsec) and the P12HUDF field 
(Rhmode = 0.03 ± 0.03 arcsec). 

From these results we can draw another conclusion. 
While we could not exclude the presence of a non negli- 
gible population of faint and extended galaxies, it is clear 
that the typical (mode) physical dimension at L < L* is 
smaller than for more luminous gala xies. A similar result 



has been found on the HUDF also by Oesch et al. (2010b) 

We thus have found evidence for a size-luminosity 
relation (in the sense that bright galaxies can be ex- 
tended while faint galaxies are always compact/small) 
at high redshifts, confirming the general trend found 
by star forming galaxies (LBGs, LAEs) at lower-z 
flNagy et al. 20lT| IBond et al. 20121 ILaw et al. 2012)) . 
This is also in agreement with the works of 



Vanz ella et al (2009 
Malhotra et al. (201 



Pentericci et al. (2010) 
they found that LAEs at z ~ 3 — 1 
are in general smaller than the LBG sample and fainter 
in the UV continuum. They also showed that galaxies 
with small or negative EW in Lyman- a span a wide range 
of sizes and UV continuum luminosities, while large EW 
objects tend to be very small and faint in Mjjv> m practice 
line emitters tend to be small galaxies, while amongst 
LBGs there are both small and large galaxies. 

A comparison can be carried out at this stage between 
the size/luminosity relation of LBG s at z ~ 7 with the one 



of star-forming galaxies at lower-z. |Papovich et al. (2005) 
found that the typical size of spiral galaxies with Muv ~ 
—21 at z ~ 2 — 3 is ~ 2kpc, slightly larger than our de- 
termination (~ 0.8 kpc) at the same luminosity. We can 
thus infer that the size-luminosity relation of star-forming 
galaxies is evolving slowly from z = 7 to z = 2, in agree- 
ment with the (1 + z)^ 1 evolution of Rh with redshift 



found by Bouwens et al. (2004) at lower-z and confirmed by 
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Fig. 10. The redshift evolution of the size-luminosity rela- 
tion from z ~ to z =7. The cyan empty star s show the 
local relation found by de Jong fc Lacey (2000)| taking the 
absolute magnitudes in the I band from their work and ap- 
plying a co nstant shift Mjjy — M j = 1.0. The blue line is the 
relation by Roche et al. (1996) for irregular galaxies at z ~ 
0.5, the empty squares and triangle s represent galaxies at 
0.7 < z < 1.4 and 1.9 < z < 3.0 by |Papovich et al. (2005)] 
and the pentagons are the typical half light radii of LBGs 
at z=2.5 (magenta, top), 3.8 (green), 4.9 (brown), and 6.0 
(grey, bottom) by Bouwens et al. (2004) The black dots 
with error bars are the z=7 relation found in this work. 



Hathi et al. (2008)| at z 
at z ~ 7. 



5-6 and by Oesch et al. (2010b) 



Local irregulars and spirals are characterized b y a sim- 

derived 

a relation between the half 



le lumi- 



_ht radius and t 
nosity at z ~ 0.5 of Rh cx L 1 / 2 . Fig|TU] summarizes 
the size-luminosity relations at various redshifts described 
above, showing a continuous evolution in Rh from z=7 
(ou r points) to z=l-3 (th e empty squares and triangles 
by Papovich et al. (2005) I. At l ower redshift the s itua- 
tion is currently not clear, with Roche et al. (1996) find- 
ing a steep relation between the half light radius and 
the luminosity of galax ies at z=0.5, Rh cx L 1 ' 2 , while 
de Jong & Lacey (2000) measured a flatter relation Rh cx 
L l/i , and slightly lower in normalization at M\jy = —21 for 
the local galaxies. Despite this discrepancy at lower-z, the 
normalization of the local size-luminosity relation is higher 
at low-z than at z ~ 2 — 7: a spiral/irregular galaxy with 
Mjjv ~ — 21 has a physical (proper) size of 6-10 kpc at 
z < 1, compared to 2 kpc at z ~ 2. This implies that a sig- 
nificant evolution on the size/luminosity relation happened 
in the last 8 Gyr of the lifetime of the Universe, in contrast 
with a slow growth at z > 2. 



6. Discussion 

6.1. The impact on the reionization process 

The presence of a size/luminosity relation at z ~ 7 has im- 
portant implications for the luminosity function of LBGs 
at z ~ 7 and the role of stars on the reionization of the 
Universe. The observed lack of extended galaxies at faint 
magnitude limits (Rh ~ 0.2 arcsec at J > 27) implies 
that the typical half light radius is less than ~0.1 arcsec 

(Rhmode) ■ 

The present data on individual fields are not able to 
give stringent constraints on the amplitude of the size dis- 
tribution (o-Rh), as described above. We explore here the 
possibility to sum up the likelihood regions for all the fields 
probing the faint side of the magnitude distribution, namely 
the ERS, GDS, P12HUDF and HUDF, in the magnitude 
range 26.6 < J < 28.6. Adding together the outputs of 
these four fields, we derive a best fit of Rh mo d e — 0.09±0.01 
arcsec and a ctra = 0.06 ± 0.04. We thus explore the de- 
pendence of the slope of the faint end of the z ~ 7 LBG 
Luminosity Function a on the shape of the half-light radius 
distribution. We fix the parameters Rh mo d e to 0.09 and u^h 
to 0.06, which are the best fit values found above for the 
combination of all the faint fields. We compute the number 
density of z ~ 7 LBG s using the stepwise technique as in 
|Grazian et al (2011)1 for the HUDF and P12HUDF fields 
separately and plot the results in Fig[TTJ At faint absolute 
magnitudes, Mjjy ~ —18, the field to field variation in the 
number counts is comparable with the statistical errors (a 
combination of Poisson noise and uncertainties due to the 
conversion of observed J band magnitudes into absolute val- 
ues taking into account the output of our simulations). In 
particular, in the P12HUDF field an excess of faint z ~ 7 
galaxies have been detected, which results in a relatively 
steep LF for this field. We have checked that this enhance- 
ment is not due to an artificial overcorrection due to an 
excess of false positive rejection rate at faint magnitudes in 
the P12HUDF field. The number of simulated galaxies at 
z ~ 7 which are not recovered by our criteria in this field 
is 11% and it is similar to that found in HUDF (12%). 

We have fitted with a Schechter function 
(jSchechter 1976|) the individual stepwise results for 
HUDF and P12HUDF adding the LF determinat ions 
at Mjjv < —20.5 discussed in Grazian et al. (2011)| and 
shown in FigfrT] with filled symbols. Fixing <!>* and M* 
to the values provided in Grazian et al. (201 1)[ namely 
0.00074 and -20.14, we obtained a = -1.65 ± 0.09 and 
-1.83±0.18 for the HUDF and P12HUDF, respectively. We 
can derive a mean value for a by averaging the number 
counts in the HUDF and P12HUDF fields. We thus 
obtained a — —1.7 ± 0.1, where the large uncertainties 
reflects the strong field to field variation affecting the 
present data. 

If we consider Rh moc i e — 0.09 arcsec and a^h — 0.48, 
we obtain the most extended size distribution still al- 
lowed at the 95% confidence level by the combined like- 
lihood regions in the magnitude range 26.6 < J < 28.6. 
Using this distribution into the simulations, we derived 
two luminosity functions at z ~ 7 for the HUDF and 
P12HUDF fields as described above, and fitting them with 
a Schechter function we obtained a = —1.87 ± 0.12 and 
a = —2.05 ± 0.23, respectively. Combining the two fields, a 
best fit of a = —1.95 ± 0.15 has been derived. We thus 
confirm the anti-correlation between the parameter tjRh 
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Fig. 11. The Luminosity Function of LBG galaxies at 
z ^ 7 computed with the Stepwise method and assuming 
a log-normal size distribution with parameters Rh mo d e = 
0.09 ± 0.01 arcsec and a Rh = 0.06 ± 0.04. The filled sym- 



2009; 



red 
dots by 



bols show the data points presented i n|Grazian et al. (2011) 
(black dots and upper limits by lOuchi et al 
squares by |Castellano et al. (2010b) and cyan 
Bouwen s et al. 2010pp . while the open points are the num. 
ber densities for the HUDF (triangles) and P12HUDF (cir- 
cles). The cosmic variance between the two fields is greater 
than the statistical uncertainties. 



an d the resulting steep ness a of the LF, already found 



Grazian et al. (2011) 



that the faint end of the 



In our previous paper, we found 
z ~ 7 LBG Luminosity Function 
~-1.7 (see their Fig. 8), when a compact morphology 
< 0.15 arcsec) is adopted. For more extended mor- 



al is 
(Rh 

phologies (Rh ~ 0.25 arcsec) a steeper LF was derived 
(a ~ —2), confirming these results. 

A plausible value for the relevant UV emissivity of LBGs 
at z = 7, puVi can be computed by integrating the present 
z=7 LF with a = —1.7 down to M1500 = —10, assuming 
that the steepness of the faint end of the LF remains con- 
stant down to fluxes significantly fainter than that reached 
by our deepest images, the HUDF and P12HUDF ones. 
Using M* = -20 .14, Log®* = -3.13 for z=7 as in 
obtain a luminosity density of 
1 Hz -1 Mpc~ 3 corresponding 



Grazian et al. (2011) 



we 



-10) = 1.1-10+^ erg s 



Puv{ 

to a = —1.7 (the 1 a lower and upper bounds for this quan- 
tity are 8.1 • 10+ 25 and 1.5 • 10+ 26 erg s" 1 Hz' 1 Mp C - 3 , 
corresponding to a — —1.6 and a = —1.8, respectively). 
We have no information on the number density of z ~ 7 
galaxies at these faint magnitudes (Muv = —10) with the 
present data, so it is useful to stress that this is a very 
strong extrapolation. 

Given these confi dence limits on puy a nd adopting the 
same assumptions of Grazian et al. (2011) in order to have 
the Universe ionized at z = 7 we derive a lower limit for 
the Lyman Continuum escape fraction f esc > 0.14 x Chii 
for a = —1.7, where Chii is the dumpiness of the IGM 
at z=7. Taking into account the uncertainties on the faint 
end of the LF, this translates into f esc > 0.10 x Chii for 



a = -1.8 and f esc > 0.19 x Chii for a = -1.6 (at 68% 
c.l.), respectively. 

In the following, we will consider the implications of a 
luminosity function at z=7 with a — —1.8, in order to re- 
lax the constraints on the other two parameters, f esc and 
Chii- Assuming a maximum escape fraction of 1.0, the 
above limits can be converted into constraints to the IGM 
at z=7, Chii < 10.0, provided that the only source of UV 
photons are stars and the Universe is fully ionized at z=7. 
Of course, smaller values for the dumpiness Chii are re- 
quired if the escape fraction is less than 100% or the LF is 
flatter than a — —1.8. We must consider Chii > 1, since 
galaxies at z ~ 7 are formed in biased density regions of 
the Universe and the IGM is not homogeneous (Chii = 1) 
at these redshifts. Limiting the dumpiness to Chii > 1, 
we thus have f esc > 0.10 in order to have the Universe 
reionized by z=7. Bolton & Haehnelt (2007) have inferred 
Chii < 3 at z ~ 6 from the Lyman-a forest photoioniza- 
tion state: since Chii is expected to monotonically decrease 
towards high-z in a hierarchical Universe, an escape frac- 
tion of f esc ~ 0.3 is enough for stellar ionizers to reach the 
reionization of the IGM at z=7. A similar result has been 
derived bv lShull et al. 20T21 based on hydrodynamical and 
N-body simulations, of Chii ~ 1.5 — 3.0 at z > 6. Their 
model has more free parameters (the electron temperature 
of the IGM, the IMF of the stellar population producing 
UV photons) than considered here, and they derived that 
an escape fraction of 20% is enough at z=7 to keep the 
Universe ionized. 

It is worth noting that the recent estimates of f esc for 
L > L* LBGs at < z < 3 are < 10% ([Bridge et al. 2010 
ICowie et al. 20101 ISiana et al. 20101 IVanzella et al. 2010 
IBoutsia et al. 20TT|) : thus, assuming that the Universe is 
only re-ionized at z ~ 7 by stars in galaxies, this implies 
a fast increase of the galaxy escape fraction going to faint 
luminosities or to high redshifts (but see Nes tor et al. 20111 
for a larger estimate of the escape fraction of L* LBGs at 
z ~ 3). 

We neglect in our computation the contribution 
of AGNs, since their LF at z=7 is still unknown 
and the upper limits currently available indicate that 
the AGN will add only 5-8% to the luminosity den- 
sity of galaxies (f O uchi et al. 20091 ICowie et al. 20101 
|Haardt fc Madau (2011)[ ). These estimates however are de- 
rived extrapolating the behavior of very bright QSOs se- 
lected by SDSS (Muv ~ —26) down to very faint magni- 
tude limits. Recent results using the 4 Msec Chandra ob- 
servation in the GOODS-South region pointed out that the 
X-ray selected AGNs present a rather steep LF toward faint 
magnitudes (jFiore et al. 201 1| . they are characterized by a 
significant escape fraction (half of them have f esc ~100%, 
see [Vanzclla et al. 2010) and thus they could be the main 
rcsponsibles of the reionization process at z > 6. 

Summarizing, we have found here that the number den- 
sity of faint LBGs at z ~ 7 can be fitted with a Schechter 
luminosity function with a faint end slope of a = — 1.7±0.1, 
assuming a size distribution with Rh. mo d e — 0.09 ±0.01 arc- 
sec and (jRh — 0.06 ± 0.04. The value of the parameter a 
depends critically o n the size distribution of faint LBGs, as 
we already found in Grazian et al. (2011)| and the slope of 
the LF at z=7 can be steeper if the half-light radii of LBGs 
at this redshift extend towards larger values. We have de- 
tected the large field-to-field variation in the number den- 
sity of faint galaxies at z ~ 7, but this does not prevent us 
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from deriving a limit on the steepness of the z-dropout LF, 
a = —1.7 ± 0.1 at 68% confidence level. Using this limit, 
the Universe can be reionized by galaxies at large redshifts, 
only if their escape fraction is larger than ~ 30%. 

A plausible conclusion is that the number of faint 
galaxies at z ~ 7 in the Universe is enough to re- 
ionize the Universe (a ~ —1.7) only when extrapolat- 
ing the present LF down to M1500 = —10, assuming a 
combination of small dumpiness for the IGM and rela- 
tively high escape fraction of Lyman continuum photons 
(fesc > 0.14 x Chii)- A similar co nclusion has been reached 



also by Finkelstein et al. (2012) Since all these conditions 
are extreme assumptions, we start posing some doubts 
that the galaxies alone at z ~ 7 are able to keep the 
Universe ionized without the additional contributions of 
faint AGNs (IFiore et al. 2011|) or other more exotic ex- 



planations QDopita et al. 20ilj Conroy & Kratter (2012) I. 
Of course, invoking a dramatic increase of the escape 
fraction with redshift f esc oc (1 + z) 3A as done in 



Haardt & Madau (2011) could be an alternative solution 
to alleviate this problem. 



6.2. Is this picture conclusive ? 

The picture of the size-luminosity relation sketched in this 
work might not be conclusive, for a number of reasons: our 
results are based on photometrically selected candidates 
and they could be contaminated by lower-z interlopers; the 
bright side of the distribution is based only on a single field 
(UDS) and only half of the EGS region, and it is prob- 
ably affected by the cosmic variance effect, which should 
be stronger for more luminous galaxies; the morphology 
adopted in this work (disk galaxies with exponential pro- 
file) cannot be representative of the real galaxy shapes, es- 
pecially at z ~ 7 in the UV rest-frame, where the star- 
forming galaxies are clumpy and irregular; the distribution 
for the axial ratio parameter b/a assumed in this work (fiat 
from to 1) may not b e representative of the real one (see 



Ferguson et al. (2004) [ ); 

While we acknowledge that our analysis is not decisive, 
we tend to believe that these results are robust for a number 
of reasons. 

All the six fields used in this work are characterized by 
a combination of deep multi- wavelength photometry, which 
ensures a clean sample of candidate galaxies at z ~ 7. 
The spectroscopic confirmations of the bright candidates 
in the GOODS fields, and in other ground based surveys, 
is currently ongoing (jFontana et al. 201Q[IStark et al. 20TT1 
IVanzella et al. 20111 IQno et al. 2012[) and the fraction of 
lower-z interlopers seems to be less than 20% at z ~ 6 
(jPentericci et al. 201 ip . Thus we are confident that the 
sample adopted for studying the size-luminosity relation in 
this paper is not heavily contaminated by lower-z galaxies. 

At the moment, the bright side of the z ~ 7 galaxy 
population is sampled mainly by the UDS field, so the ro- 
bustness of the size-luminosity relation is based only on this 
region. Since the cosmic variance effect should be stronger 
for more luminous galaxies, one can suspect that it is only 
by chance that bright galaxies in the UDS are also the 
largest, when compared to the ones in deeper fields (i.e. 
HUDF). However, at J < 25.8 we have 7 galaxies with 
Rh > 0.2arcsec out of a total of 14 objects (50%) in the 
combined UDS and EGS fields, which is hard to be inter- 
preted only as a simple cosmic variance effect. In the near 



future the CANDELS-Wide survey will complete the EGS 
and cover the COSMOS field at a similar depth and areal 
coverage of the UDS, thus enhancing by a factor of two the 
number of galaxies in the bright side of the size-luminosity 
relation and reducing the uncertainties on Rh mo de- 

The size-luminosity relation measured in this work 
could also be an artifact due to the frequent merging of 
galaxies expected at very high-z. Instead of measuring a 
physical size, the large Rh found for relatively bright ob- 
jects could be an estimate for the separation of their clumps 
during the merging phase. Detailed kinematic analysis with 
IFU spectroscopy at Extremely Large Telescopes (ELTs) of 
30-40 meters of diameter, as done currently with 8m class 
telescopes on z ~ 2 galaxies (|Forster-Schreiber et al. 2006, 
ILaw et al. 2012[) , would distinguish these two plausible hy- 
potheses. 

A more reliable distribution for the ax ial ratio param- 
eter b/a, following Ferguson et al. (2004) can be adopted 
in our simulations, but we must stress that, in their work, 
this functional form is measured on a sample of photomet- 
rically selected candidate galaxies at z ~ 4, and it is not 
clear whether this should be applied also to our z ~ 7 
sample. A more detailed approach, like that adopted in 
ILaw et al. 20121 work, is more appropriate but very com- 
plex and goes beyond the aims of this paper. 



7. Conclusions 

Galaxy sizes (half light radii) have been measured 

for a sample of 153 galaxy candidates at z ~ 

7 from the CANDELS HST Multi-Cycle Treasury 



Program ( |Grogin et al. (2011) _ Koekemoer et al. (2011)[ ) 



and HUDF09 project QBouwens et al. (2011)p . In partic- 
ular, we have used the deep HST imaging database in 
BVIZYJH bands for the ERS, GDS, P12HUDF and HUDF 
fields together with the wide area observations in the UDS 
and EGS fields in the VIJH bands. We select the galaxy 
candidates at z=7 through the classical z-dropout tech- 
nique, which has been verified by deep VLT spectroscopy 
(jPentericci et al. 201 ip . For the UDS and EGS we use the 
HST igi4 band as dropout to select high-z galaxy candi- 
dates. 

Despite the difficulties of measuring galaxy morphology 
at z ~ 7, thanks to detailed and extensive simulations, we 
successfully detect a clear size-luminosity relation for LBGs 
at high-z. In particular, we found evidences that: 

— bright galaxies can be large. At magnitude brighter 
than J = 26.6 (corresponding ~ L* at z=7) galax- 
ies have been observed at larger dimensions (Rh ~ 
OAarcsec or equivalently 2.3 kpc proper) than at faint 
magnitudes. Again, Figj8]shows the extended tail in Rh 
which is present only for bright galaxies: despite all the 
deeper fields are sensitive to such extended galaxies, 
none of them have been found at J > 26.6. 

— faint galaxies are small. At J > 26.6 the observed 
sizes of z=7 galaxies are smaller than 0.2 arcsec (cor- 
responding to 1.15 kpc proper). This is evident looking 
both at Fig[5]and as a result of the detailed simulations 
summarized by Table [TJ 

a size-luminosity relation is already in place at 
z=7. The observed dependency of LBG sizes from 
galaxy luminosity at z=7 has been shown in FigJHl We 
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have found that a relation exist between these two ob- 
servables, Rh oc L 7 , with 7 ~ 1/3 — 1/2. 

These results have deep implications for our under- 
standing of the reionization of the Universe. The derived 
size-luminosity relation at z ~ 7 and the fact that faint 
LBGs have typical half light radii of ~0.1 arcsec seems 
to indicate that the slope of the z=7 LF is not extremely 
steep, due to the cor relation between size and a found in 
Grazian et al. (2011) and recovered here. We detect also 
a strong field to field variation in the faint regime, with 

a 1.6 in the HUDF and a 1.8 in the P12HUDF 

field. Using an average value for the two samples, we de- 
rived a = — 1.7±0.1. The relevant UV emissivity of LBGs at 
z = 7, puv, has been computed by integrating the best fit 
LF down to Mi 50 o = —10, and resulted in p uv = 1.1 ■ 10 +26 
erg s~ l Hz -1 Mpc~ 3 . This amount of radiation is not 
able to keep the Universe re-ionized if the IGM is clumpy 
(Chii > 3) and if the Lyman continuum escape fraction of 
high-z LBGs is relatively low (f esc < 0.3). The only con- 
figuration that allows a non-neutral Universe due to stellar 
ionizers in galaxies is if the LBG LF is steeper than —1.7, 
combined with a small dumpiness for the IGM and a high 
escape fraction of Lyman continuum photons. These are 
no t implausible conditions (see IBolton fe HaehneE 2007 
or Haardt & Madau (2011)), but of course they are ex- 
treme assumptions, that could be overcome by simpler ex- 
planations, like an additional contribution of faint AGNs 
(see IFiore et al. 20110 , or more exotic explanations (see 
|Dopita et al. 20110 . 

Our results on the size distribution and size-luminosity 
relation of z = 7 LBGs have been investigated through 
detailed and realistic simulations and thus they are robust 
conclusions. However, the dataset used, especially at the 
bright side (UDS), is not free from biases due to cosmic 
variance effects, and we cannot exclude that subtle effects 
can modify our results. Investigating wide fields searching 
—20 galaxies will be very useful to 
CANDELS-W ide s urvey, 
2007|) and 



uv 



< 



for -21 < 

reinforce our statements. The 
by observing the COSMOS (IScoville et al 



the whole EGS (|Davis et al. 20070 fields, for a total of 
other 300 sq. arcmin down to J=26.7, will provide ~ 70 
additional bright candidates at z = 7, and would be able to 
beat down the cosmic variance affecting the bright side of 
z = 7 distribution. In addition, WFC3 imaging of Y ~ 25 
z-dropout galaxies found with large area ground based 



imaging (IQuchi et al. 20091 |Castellano et al. (2010a)| 



Castellano et al. (2010b)| IBowler et al. 20120 will provide 



useful information on the bright side of the size-luminosity 
relation not yet covered by the present observations. 

Moreover, the CANDELS Deep survey plus the HUDF 
ultradeep fields (both those already observed and the HST 
program recently approved for Cycle 19) will extend these 
results and confirm in a more accurate statistical evidence 
the trend of the size-luminosity relation at z ~ 7. In par- 
ticular, the CANDELS-Deep region on the two GOODS 
fields, covering 150 sq. arcmin down to J ~ 28, will open 
a very interesting window on the exact shape of the size 
distribution down to very large half light radii. In addi- 
tion, the combination of depth and area guaranteed by the 
CANDELS-Deep survey will decrease the uncertainties on 
the faint side determination of the z ~ 7 LF due to cosmic 
variance. 



Clearly, the size-luminosity relation found here is the 
simplest correlation between physical properties of galaxy 
at z ~ 7. With a full spectroscopic sample in hand, cou- 
pled with the deep multi-wavelength dataset available for 
the CANDELS survey, it will be possible to explore the de- 
pendencies of the half light radius with the galaxy stellar 
mass, SFR, dust extinction or the EW of Ly-a, as already 
done for star-forming galaxies at smaller redshifts. 
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Table A.l. z ~ 7 galaxy candidates in the GDS field Table A. 3. z ~ 7 galaxy candidates in the UDS field 



ID 


RAD 


DEC 


J 


Rh( 


axcsec) 


565 


53 


074491 


-27.877560 


26.77 





167 


959 


53 


079016 


-27.873006 


27.07 





187 


1240 


53 


072728 


-27.869255 


26.48 





258 


1710 


53 


053794 


-27.864331 


27.33 





175 


1903 


53 


101737 


-27.862297 


26.50 





116 


2197 


53 


086466 


-27.859149 


26.99 





140 


2610 


53 


123502 


-27.855121 


27.67 





199 


2613 


53 


112600 


-27.855104 


27.15 





264 


2721 


53 


091561 


-27.853837 


26.66 





204 


3331 


53 


106017 


-27.848158 


26.54 





161 


6444 


53 


101396 


-27.820836 


27.64 





131 


6483 


53 


099068 


-27.820457 


26.17 





321 


6619 


53 


036200 


-27.819195 


27.64 





142 


7015 


53 


064118 


-27.815569 


27.33 





160 


7817 


53 


058939 


-27.808653 


27.60 





189 


8899 


53 


066929 


-27.799365 


26.95 





215 


9705 


53 


052408 


-27.791223 


25.73 





149 


10362 


53 


096474 


-27.787018 


26.99 





134 


10453 


53 


083067 


-27.786274 


27.80 





134 


11438 


53 


033874 


-27.778009 


26.32 





188 


13233 


53 


083367 


-27.764388 


27.28 





119 



Table A.2. 



7 galaxy candidates in the P12HUDF field 



ID 


RAD 


DEC 




r 


Rh(arcsec) 


173 


53 


263422 


-27 


703536 


28 


50 





189 


330 


53 


264582 


-27 


700250 


27 


26 





352 


337 


53 


258939 


-27 


700140 


28 


40 





140 


355 


53 


255364 


-27 


699911 


28 


12 





235 


686 


53 


266042 


-27 


693785 


28 


60 





197 


801 


53 


260144 


-27 


692035 


27 


51 





141 


1059 


53 


248301 


-27 


689142 


27 


19 





187 


1212 


53 


283644 


-27 


687548 


27 


72 





171 


1213 


53 


249308 


-27 


687531 


28 


29 





115 


1271 


53 


265814 


-27 


686757 


28 


79 





123 


1278 


53 


284528 


-27 


686671 


27 


93 





106 


1364 


53 


236245 


-27 


685609 


27 


01 





176 


1403 


53 


232299 


-27 


685280 


27 


96 





104 


1404 


53 


240862 


-27 


685269 


28 


43 





149 


1545 


53 


234488 


-27 


683431 


28 


41 





134 


1634 


53 


253769 


-27 


682253 


28 


24 





193 


1744 


53 


245689 


-27 


680663 


27 


41 





190 


1750 


53 


245773 


-27 


680770 


28 


68 





137 


1988 


53 


254990 


-27 


677560 


28 


78 





122 


2005 


53 


248703 


-27 


676469 


25 


94 





162 


2070 


53 


238318 


-27 


676401 


28 


10 





115 


2226 


53 


243749 


-27 


673300 


27 


85 





122 


2261 


53 


244625 


-27 


672749 


28 


65 





129 



ID 


RAD 




DEC 




I 


Rh( 


ircsec) 


zphot 


2047 


34 


280510 


-5 


2683411 


26 


59 





151 


yes 


2635 


34 


488762 


-5 


2656941 


25 


72 





184 


no 


2638 


34 


488934 


-5 


2657399 


26 


64 





140 


no 


2782 


34 


424973 


-5 


2652078 


26 


54 





156 


yes 


3147 


34 


305576 


-5 


2637830 


26 


43 





176 


no 


4101 


34 


389111 


-5 


2595940 


26 


49 





174 


no 


4154 


34 


422218 


-5 


2592878 


25 


92 





182 


yes 


6374 


34 


475731 


-5 


2484989 


25 


62 





159 


no 


6409 


34 


280376 


-5 


2483692 


25 


70 





326 


yes 


6443 


34 


482029 


-5 


2481699 


25 


66 





148 


yes 


9386 


34 


321495 


-5 


2355309 


26 


23 





214 


yes 


10527 


34 


415421 


-5 


2307229 


26 


02 





179 


yes 


13324 


34 


367481 


-5 


2190270 


26 


18 





239 


no 


13796 


34 


224491 


-5 


2169590 


26 


04 





244 


no 


14435 


34 


323608 


-5 


2141371 


26 


57 





172 


no 


15399 


34 


233883 


-5 


2100158 


25 


43 





241 


yes 


15482 


34 


253719 


-5 


2095661 


26 


00 





160 


yes 


16119 


34 


253719 


-5 


2068028 


26 


30 





189 


yes 


16669 


34 


279049 


-5 


2043710 


26 


48 





148 


yes 


16910 


34 


226192 


-5 


2033339 


26 


50 





121 


yes 


16974 


34 


313725 


-5 


2030821 


26 


02 





247 


yes 


17160 


34 


313225 


-5 


2023530 


26 


37 





180 


no 


18356 


34 


325001 


-5 


1977878 


26 


59 





160 


no 


18777 


34 


283592 


-5 


1959820 


26 


34 





186 


yes 


18797 


34 


278481 


-5 


1958232 


26 


16 





145 


no 


18932 


34 


482838 


-5 


1953049 


26 


17 





158 


yes 


19136 


34 


467083 


-5 


1944618 


26 


25 





193 


yes 


19390 


34 


434162 


-5 


1934099 


26 


10 





170 


no 


19476 


34 


354675 


-5 


1930408 


26 


03 





179 


yes 


19818 


34 


376148 


-5 


1916552 


26 


55 





156 


yes 


21850 


34 


408279 


-5 


1825972 


25 


53 





410 


no 


23427 


34 


298386 


-5 


1760311 


25 


83 





234 


yes 


24592 


34 


359615 


-5 


1711669 


25 


58 





115 


yes 


25519 


34 


308323 


-5 


1673961 


26 


65 





147 


yes 


26120 


34 


443848 


-5 


1646738 


26 


21 





196 


yes 


26125 


34 


337284 


-5 


1646609 


26 


35 





151 


yes 


27601 


34 


447922 


-5 


1583772 


25 


86 





224 


no 


28737 


34 


229103 


-5 


1533098 


25 


97 





210 


yes 


29249 


34 


226135 


-5 


1510921 


25 


98 





143 


no 


29272 


34 


493279 


-5 


1510839 


26 


12 





160 


no 


30492 


34 


490051 


-5 


1458049 


25 


79 





188 


yes 


32650 


34 


479736 


-5 


1365409 


26 


61 





148 


no 


33062 


34 


413895 


-5 


1347852 


25 


86 





179 


yes 


33622 


34 


269180 


-5 


1324148 


26 


28 





188 


no 


33684 


34 


446419 


-5 


1321688 


26 


70 





122 


no 


34364 


34 


313526 


-5 


1293492 


25 


68 





196 


yes 



We mark with the label zph ot = yes the 27 candidates selected 
by McLure et al. (in prep.) with a robust photometric redshift 
consistent with z > 7. 



Appendix A: z ~ 7 galaxy candidates in the GDS, 
P12HUDF, EGS and UDS fields 

The z ~ 7 candidate galaxies i n the ERS and HUDF fields 
have been already presented in Grazian et al. (2011) Here 
we provide the lists of the z-band drop-out candidates for 
the GDS (Table [OJ, P12HUDF (Table EU, and 1814- 
band dropout from the UDS (Table [O]) and EGS (Table 
[Oil fields. In Table [O] (^Oj) we mark the 27 (13) can- 
didates selected by |McLure et al. (in prep.) with a robust 
photometric redshift consistent with z > 7 in the UDS and 
EGS fields, respectively. 



Appendix B: Output of completeness simulations 
in the EGS, ERS, GDS, P12HUDF, and HUDF 
fields 



Fie lB.ll shows the results of the simulations carried out in 
the UDS field. A Sersic profile of index n = 4 has been 
adopted as input morphology for our simulations. This plot 
shows the comparison between the half-light radius (in arc- 
sec) as measured by SExtractor as a function of the in- 
put one, for the magnitude range 21 < J < 24 (top) and 
24 < J < 25 (bottom). Fig lB.2l provides the same infor- 
mation for simulated galaxies in the HUDF field, in the 
range 25.5 < J < 26.5 (top) and 26.5 < J < 27.5 (bot- 
tom). The measured size of the objects cannot be smaller 
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Table A.4. 



7 galaxy candidates in the EGS field 



ID 



RAD 



DEC 



Rh(arcsec) zphot 



1976 

4022 

5970 

6168 

6845 

7908 

8053 

8371 

10433 

10671 

10865 

13537 

14449 

14516 

15250 

15423 

15937 

19048 

19761 

20105 

21147 



214.854416 
214.732269 
214.779892 
215.091064 
215.066467 
215.188446 
215.145386 
214.710114 
215.081482 
215.060745 
214.705551 
214.677643 
214.946732 
214.945587 
214.998825 
215.130081 
215.095123 
214.670349 
215.077621 
214.628082 
214.863052 



52.759621 
52.685204 
52.732086 
52.954372 
52.941925 
53.033737 
53.004257 
52.696918 
52.972218 
52.958767 
52.707199 
52.703339 
52.900570 
52.900261 
52.942127 
53.035568 
53.014256 
52.731167 
53.026142 
52.709671 
52.889496 



25.52 
25.81 
26.26 
26.62 
25.64 
26.25 
25.32 
26.62 
25.43 
26.56 
26.26 
26.22 
26.64 
26.13 
26.11 
26.57 
25.46 
26.40 
26.54 
26.03 
26.19 



0.210 
0.199 
0.152 
0.149 
0.303 
0.205 
0.196 
0.160 
0.203 
0.162 
0.199 
0.228 
0.159 
0.146 
0.248 
0.141 
0.218 
0.149 
0.145 
0.225 
0.182 



no 
yes 
no 
yes 
yes 
no 
no 
yes 
yes 
yes 
yes 
no 
no 
yes 
yes 
yes 
yes 
yes 
no 
no 
yes 



We mark with the label zph ot — yes the 13 candidates 
by McLure et al. (in prep.) with a robust photometric 



selected 
redshift 



consistent with z > 7. 



than the instrumental PSF (which is about 0.18" of FWHM 
in the J band, corresponding to an half light radius of 
0.11"). Because of the convolution with the PSF, all ob- 
jects intrinsically smaller than ~ 0.2" are biased high by 
the SExtractor estimate. 




Fig. B.l. The results of our simulations for z ~ 7 galax- 
ies with Sersic profiles of index n = 4 in the UDS field. 
The plot shows the comparison between the half-light ra- 
dius (in arcsec) as measured by SExtractor as a function of 
the input one. The upper and lower panel refer to different 
total magnitudes, as reported in the legend. The red line 
shows the identity relation. The blue points and errorbar 
shows the average value and the relevant r.m.s. of the out- 
put half-light radius. At small sizes the output half light 
radius is typically larger than the input one due to the con- 
volution with the instrumental PSF carried out during the 
simulations. 



Fig |B.3l shows the observed J-band magnitudes versus 
the measured sizes for simulated (small dots) and observed 
galaxies (red triangles) at z ~ 7 for the EGS, ERS, GDS, 
P12HUDF and HUDF fields. The solid blue line shows the 
50% completeness level for an input simulated profile of disk 
galaxy. Detailed explanations on the procedure adopted in 
these simulations can be found in section 4.1. 



Appendix C: Best Fit 

The confidence levels for the maximum likelihood analysis 
of the size distribution on the combined ERS and GDS 
fields (intermediate sample), and the combined P12HUDF 
and HUDF fields (faint sample). The best fit is indicated 
by the magenta point while the green, blue, and red regions 
define the uncertainties at 68%, 95%, and 99.7% (1,2, and 
3 sigma) confidence level, respectively. 



Grazian et al.: The size- luminosity relation at z ~ 7, Online Material p 3 




Fig. B.3. The observed J magnitude vs size of simulated (small dots) and observed galaxies (red triangles) at z ~ 7 for 
the EGS, ERS, GDS, P12HUDF and HUDF fields. 



Grazian et al.: The size- luminosity relation at z ~ 7, Online Material p 4 




Rh mode (arcsec) Rh mode (arcsec) 

Fig. C.l. The composite likelihood distribution for the intermediate (26.6 < J < 27.6, left) and faint (27.6 < J < 28.6, 
right) magnitude bins. The best fit is indicated by the magenta point while the green, blue, and red regions define the 
uncertainties at 68%, 95%, and 99.7% (1,2, and 3 sigma) confidence level, respectively. 




Fig. B.2. The results of our simulations for z ~ 7 galax- 
ies with Sersic profiles of index n = 4 in the HUDF field. 
The plot shows the comparison between the half-light ra- 
dius (in arcsec) as measured by SExtractor as a function of 
the input one. The upper and lower panel refer to different 
total magnitudes, as reported in the legend. The red line 
shows the identity relation. The blue points and errorbar 
shows the average value and the relevant r.m.s. of the out- 
put half-light radius. At small sizes the output half light 
radius is typically larger than the input one due to the con- 
volution with the instrumental PSF carried out during the 
simulations. 



